Method for processing physical document images

ABSTRACT

A method for performing image processing of a physical document image. The method may include receiving an image file including an image of a physical document and a control target. The method may include measuring control target attributes of the control target. Measuring the control target attributes of the control target may include numerically measuring a degree of focus of the image file based on the control target. The method may include performing a static crop of the image file based on the control target attributes by removing the control target from the image file to create a cropped image.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation application, which claims the benefitof:

-   -   (1) U.S. patent application Ser. No. 14/216,489, filed Mar. 17,        2014, which claims the benefit under 35 U.S.C. § 119(e) of U.S.        Provisional Application No. 61/786,886, filed on Mar. 15, 2013,        and U.S. Provisional Application No. 61/824,834, filed May 17,        2013; and    -   (2) U.S. patent application Ser. No. 14/216,407, filed Mar. 17,        2014, which claims the benefit under 35 U.S.C. § 119(e) of U.S.        Provisional Application No. 61/786,886, filed on Mar. 15, 2013,        and U.S. Provisional Application No. 61/824,834, filed May 17,        2013;        U.S. patent application Ser. Nos. 14/216,489 and 14/216,407 and        U.S. Provisional Application Nos. 61/786,886 and 61/824,834 are        herein incorporated by reference in their entirety.

FIELD

Embodiments of the invention are directed generally toward a method forprocessing physical document images captured by an apparatus.

SUMMARY

Accordingly, an embodiment includes a method for performing imageprocessing of a physical document image. The method may includereceiving an image file including an image of a physical document and acontrol target. The method may include measuring control targetattributes of the control target. Measuring the control targetattributes of the control target may include numerically measuring adegree of focus of the image file based on the control target.Numerically measuring the degree of focus of the image file based on theat least one control target may include: determining a location of alight-dark pattern area of the control target; determining pixel valuesof the light-dark pattern area; determining a ratio of dark to lightpixels for each of a plurality of segments of the light-dark patternarea; determining which of the plurality of segments of the light-darkpattern area exceed a predetermined dark-to-light pixel ratio threshold;and determining the degree of focus based on a number of segments thatexceed the predetermined dark-to-light pixel ratio threshold. The methodmay include performing a static crop of the image file based on thecontrol target attributes by removing the control target from the imagefile to create a cropped image.

Additional embodiments are described in the application including theclaims. It is to be understood that both the foregoing generaldescription and the following detailed description are exemplary andexplanatory only and are not restrictive. Other embodiments of theinvention will become apparent.

BRIEF DESCRIPTION OF THE FIGURES

Other embodiments of the invention will become apparent by reference tothe accompanying figures in which:

FIG. 1A shows a view of an exemplary imaging station configured to imagephysical documents;

FIG. 1B shows a side view of a frame of the exemplary imaging stationdepicted in FIG. 1A;

FIG. 1C shows an isometric view of a frame of the exemplary imagingstation depicted in FIG. 1A;

FIG. 1D shows an exploded view of a frame of the exemplary imagingstation depicted in FIG. 1A;

FIG. 2A shows a partial view of a further exemplary imaging stationconfigured to image physical pages of a book;

FIG. 2B shows a further partial view of the further exemplary imagingstation depicted in FIG. 2A;

FIG. 2C shows an additional partial view of the further exemplaryimaging station depicted in FIG. 2A;

FIG. 2D shows an additional partial view of the further exemplaryimaging station depicted in FIG. 2A;

FIG. 2E shows a partial view of the further exemplary imaging stationdepicted in FIG. 2A;

FIG. 3A shows a view of an additional exemplary imaging stationconfigured to image microfiche cards;

FIG. 3B shows a partial view of the additional exemplary imaging stationdepicted in FIG. 3A;

FIG. 3C shows a further partial view of the additional exemplary imagingstation depicted in FIG. 3A;

FIG. 3D shows an additional partial view of the additional exemplaryimaging station depicted in FIG. 3A;

FIG. 4A shows an exemplary vacuum insert assembly of some embodiments;

FIG. 4B shows components of the exemplary vacuum insert assemblydepicted in FIG. 4A;

FIG. 4C shows a partial cross-sectional view of the exemplary vacuuminsert assembly depicted in FIG. 4A;

FIG. 4D shows a top view of an exemplary control strip 470 suitable foruse with some embodiments;

FIG. 5A shows a view of a further exemplary imaging station configuredto image physical pages of a book;

FIG. 5B shows a partial view of the further exemplary imaging stationconfigured to image physical pages of a book depicted in FIG. 5A;

FIG. 5C shows a further partial view of the further exemplary imagingstation configured to image physical pages of a book depicted in FIG.5A;

FIG. 5D shows an additional partial view of the further exemplaryimaging station depicted in FIG. 5A;

FIG. 5E shows a further partial view of the further exemplary imagingstation configured to image physical pages of a book depicted in FIG.5A;

FIG. 5F shows another partial view of the further exemplary imagingstation depicted in FIG. 5A;

FIG. 6A shows a partial exploded view of a book vacuum chamber 590 ofthe further exemplary imaging station, which is depicted in FIG. 5A;

FIG. 6B shows a cross-sectional view of a partially opened platen cover580 and the book vacuum chamber 590 of the further exemplary imagingstation, which is depicted in FIG. 5A;

FIG. 6C shows a cross-sectional view of a closed and sealed platen cover580 and the book vacuum chamber 590 of the further exemplary imagingstation, which is depicted in FIG. 5A;

FIG. 7A shows an exemplary system topology diagram of an exemplaryembodiment of the invention;

FIG. 7B shows a further exemplary system topology diagram of anexemplary embodiment of the invention;

FIG. 8 shows a flow diagram of an exemplary initial imaging process;

FIG. 9 shows a flow diagram of a further exemplary imaging process;

FIG. 10 shows a flow diagram of an exemplary automated quality controlprocess and a manual quality control process;

FIGS. 11A-D shows exemplary depictions associated with performing coarseimage measurements of some embodiments;

FIGS. 12A-J shows exemplary depictions associated with performing fineimage measurements of some embodiments;

FIGS. 13A-F shows exemplary depictions associated with performingautomated image corrections of some embodiments;

FIGS. 14A-E shows exemplary depictions associated with performing coarsedocument isolation of some embodiments; and

FIGS. 15A-G shows exemplary depictions associated with performing finedocument isolation of some embodiments.

DETAILED DESCRIPTION

Reference will now be made in detail to the subject matter disclosed,which is illustrated in the accompanying drawings. The scope of theinvention is limited only by the claims; numerous alternatives,modifications, and equivalents are encompassed. For the purpose ofclarity, technical material that is known in the technical fieldsrelated to the embodiments has not been described in detail to avoidunnecessarily obscuring the description.

Embodiments of the invention include an apparatus, a system, and amethod for digitally imaging physical media or physical documents (suchas archival sheets, books, or microfiche cards), performing imageprocessing, and/or creating a digital document. Some embodiments of theinvention may include one or more portable imaging stations for imagingdocuments. Some embodiments also include instructions stored in acomputer readable medium which are configured to perform a method ofimage processing of the images received from one or more imagingstations; the instructions stored in a computer readable medium may beimplemented as a computer program product which may include a userfriendly graphical user interface. Some embodiments of the invention areconfigured to digitally image physical documents (such as archivaldocuments (e.g., sheets, books, or microfiche cards)), create one ormore digital documents, perform one or more digital documentmanipulations, perform one or more image processing operations, performone or more optical character recognition (OCR) operations, or performseparation, combination, association, or manipulation operations amongtwo or more data sources associated with features of embodiments, orperform other operations. For example, embodiments are configured toimage a physical document and create an associated searchable text datastructure associated with the digital image of the physical document.

Embodiments are configured to prevent errors (e.g., two images of a samephysical document sheet or skipped sheets of a physical document). Someembodiments are configured to reduce vibration of the physical mediaduring the imaging process and to increase stability. Some embodimentsare configured to preserve the integrity of the physical documents andto prevent damage. Embodiments allow for quick and simple imaging whichcreate easily organized, high quality digital document archives withsearchable text fields.

Some embodiments include one or more imaging stations, where the imagingstation includes one or more digital cameras, a document supportingmechanism or surface, one or more displays, one or more lights, an imageactivation mechanism, one or more computing devices, or the like.Additionally, some embodiments include one or more transparent vacuuminsert assemblies, one or more vacuum or compressed air systems, or thelike. The transparent vacuum insert assembly can be configured tostabilize physical documents by providing a vacuum effect to pull orhold the physical document flat against a transparent image surface. Insome implementations, elements of the embodiments of the invention arecommunicatively coupled to one another. For example, a computing deviceand camera of an imaging station are communicatively coupled, and thecomputing device of the imaging station is communicatively coupled to anetwork.

Referring now to FIGS. 1A-1D, an exemplary imaging station 100 isdepicted. Exemplary imaging station 100 is configured to image physicaldocuments 160 (such as archival sheets). Some embodiments of the imagingstation 100 include one or more digital cameras 110, one or more lightpanels 120, one or more displays 130, one or more activation buttons 140configured to trigger the camera 110 to capture an image of a physicaldocument, a document imaging area 190 (which may include a transparentvacuum insert assembly, such as depicted in FIGS. 4A-4C), an activationpedal 150, a compressed air connection 180, or the like. Someembodiments of the exemplary imaging station 100 include a modular framestructure with members configured to support elements of the imagingstation. The frame can include light panel supports 121, displaysupports 131, camera supports 111, other support members 181, or thelike.

Referring now to FIGS. 2A-2E, an exemplary imaging station 200 isdepicted. Exemplary imaging station 200 is configured to image books260. Some embodiments of the imaging station 200 include one or moredigital cameras, one or more light panels, one or more displays, one ormore activation buttons 240 configured to trigger the camera to capturean image of a physical document, a document imaging area 290 (which mayinclude a transparent vacuum insert assembly, such as depicted in FIGS.4A-4C), an activation pedal, or the like. Embodiments may additionallyinclude a camera support 250. Embodiments may include a vacuum assembly270 configured to create a lower differential pressure (e.g., a vacuum)within a transparent vacuum insert assembly to pull a page of a book 260flat against the document imaging area 290. In some implementations, thevacuum assembly operates based upon the Venturi principle by flowingcompressed air through a tee-apparatus, which includes an opening, asmaller opening 272, and side tee-branched tube 271; this effect createsa lower pressure (i.e., vacuum effect) in the tube 271; the tube 271connects to the transparent vacuum insert assembly to create a vacuumeffect which stabilizes a sheet of paper. In some embodiments, thedocument imaging area 290 includes one or more apertures 291 (such aselongated parallel apertures, round apertures, square apertures, or thelike) as part of a transparent vacuum insert assembly. Additionally, theexemplary imaging station 200 includes a slide mechanism 220, whereby aportion of the apparatus can slide on a track to easily facilitateturning a page without removing the book from book support members(e.g., 210, 230) of the imaging station.

Referring to FIG. 2D-2E, some embodiments include a lift cylinderassembly 280 configured to raise or lower a book support 230. The liftcylinder assembly can be operated or powered through a compressed air(e.g., pneumatically driven) supply 281, 282. In some embodiments, thelift cylinder assembly 280 or the imaging station 200 includes a base285.

Referring now to FIGS. 3A-3D, a further exemplary imaging station 300 isdepicted. Exemplary imaging station 300 is configured to imagemicrofiche cards 360 (which may include an image film portion 361 and atext portion 362). Some embodiments of the imaging station 300 includeone or more digital cameras 310A, 310B, one or more light panels, one ormore displays 330A, 330B, one or more activation buttons 340A, 340B eachconfigured to trigger a particular camera to capture an image of aphysical document, a document imaging area, or the like. In someembodiments, the exemplary imaging station 300 is configured to capturetwo images of a microfiche card 360, where a first image is captured ofa portion of the microfiche card with the text portion 362, and whereina second image is captured of a portion of the microfiche card 360 withthe image film portion 361. For example, a user or an automatedmechanism (such as a user-initiated automated mechanism, aprocess-initiated automatic mechanism (e.g., acomputer-program-process-initiated automatic mechanism), atrigger-initiated automated mechanism, a semi-automated mechanism, or afully automated mechanism) places the microfiche card 360 on asupporting surface 321; a first camera 310B images the text portion 362of the microfiche card 360; the user or automated mechanism then slidesthe microfiche card 360 between a card supporting surface 321 and asecond surface 320 (see, e.g., FIG. 3C); and the second surface 320includes a window 320A, whereby the window 320A is configured to allowthe second camera 310B to capture an image of the image film portion 361of the microfiche card 360. Some embodiments include ball bearings 322configure to reduce friction (and reduce or eliminate potential damagethat could be caused to archival microfiche cards during imaging) ofsliding a microfiche card 360 between the supporting surface 321 and thesecond surface 320.

Referring now to FIGS. 4A-4C, views of an exemplary transparent vacuuminsert assembly 401 of an exemplary transparent vacuum insert system 400of some embodiments are depicted. In some embodiments, the transparentvacuum insert assembly 401 includes a frame 410 (which may be comprisedof metal, plastic, acrylic, wood, carbon-fiber, fiber glass, or thelike), one or more control strips 470, pins 471, an “out of gamut”substrate 420 (e.g., which may comprise a blue, green, or a magentascreen or surface), as well as other layers 430, 440, 450, 460. In someembodiments, the vacuum insert assembly 401 is a transparent open-facevacuum insert assembly configured to support, stabilize, and exertsuction on (e.g., when the vacuum is activated) a document 480.

Referring now to FIGS. 4B-4C, in some embodiments, the metal frame 410includes holes (which exemplarily may be holes having a diameter of ¼inch), wherein the holes are configured to receive a plurality of pins471. In some embodiments, the pins 471 are configured to align thevarious layers (e.g., 410, 430, 440, 450, 460), and the pins 471 areconfigured to receive (e.g. register with or align with) the holes ofthe color strip 470.

Still referring to FIGS. 4B-4C, in some embodiments, the transparentvacuum insert assembly 401 comprises three or more layers 430, 440, 450(e.g., sheets or partial sheets) of transparent acrylic. For example, abottom layer 430 may comprise a solid sheet; a middle layer 440 maycomprise an outside frame or border; and a top layer 450 may comprise aperforated sheet or sheet with a plurality of holes 451. For example,the three or more layers 430, 440, 450 may be stacked such that a hollowvoid exists is between the top layer 450 and the bottom layer 430,wherein air may be pulled or drawn through the holes of the top layer450 when a vacuum is activated.

Additionally, in some embodiments, the transparent vacuum insertassembly 401 further comprises a transparent template layer 460. Forexample, the transparent template layer 460 may include a rectangulardocument window (which may be cut out of the template layer 460). Thedocument window of the transparent template layer 460 may be of varioussizes to accommodate documents 480 of any size. When the transparenttemplate layer 460 is laid on top of the transparent vacuum insertassembly 401, the surface of the transparent template layer 460 whichborders the document window is configured to block theperforations/holes of the top layer 450 which would otherwise be exposedto open air when a document is placed within the document window; thisallows suction to pull the document flat against the top layer 450 ofthe transparent vacuum insert assembly 401 when the vacuum is activated.

Referring again to FIG. 4B, in some embodiments, the “out of gamut”substrate 420 comprises a brightly colored surface or screen, such asbright blue, bright green, or a magenta screen or surface. In someembodiments, the bright color of the “out of gamut” substrate 420improves the ability of the imaging process to visually separate thelive documents from the background of the apparatus. In someembodiments, the “out of gamut” substrate 420 is configured to residewithin the boundaries of the frame 410 and rest upon an underlyingsurface which supports the frame; this allows a particular “out ofgamut” substrate 420 to interchangeably be swapped for a differentlycolored “out of gamut” substrate without having to open the transparentvacuum insert assembly 401.

In some embodiments, the transparent vacuum insert assembly 401 isinserted within the frame 410 and on top of the “out of gamut” substrate420. Additionally, the transparent template layer 460 may be placed ontop of the transparent vacuum insert assembly such that the holes alongthe edges of the transparent template layer 460 register with pins 471.

Referring now to FIG. 4D, an exemplary control strip 470 of someembodiments is shown. Holes of control strips 470 are configured toregister with pins 471 placed along the edges of the transparent vacuuminsert assembly 401 of some embodiments or along a document imaging areaof other imaging stations of some embodiments. In some embodiments (forexample, as shown in FIG. 4A), four control strips 470 are placed alongthe sides of the transparent vacuum insert assembly 401. The controlstrips 470 may be placed in a precise location of the live imaging areaand assists the image capture and image processing process in locatingand measuring components of images. In some embodiments, each controlstrip includes a solid color bar 470A (e.g., a black bar), a color bar470B, and a light-dark pattern bar 470C. The solid color bar 470A mayhave a known length and width, which allows an imaging process measuresize and location. The color bar 470B may have a plurality of colorareas (e.g., color blocks 470B-1, 470B-2, 470B-3, 470B-4, 470B-5,470B-6, 470B-7, 470B-8, 470B-N) with each color block being a knowndifferent color at known location relative to the solid color bar 470A;for example, in a particular embodiment, the color bar 470B includes anarrangement of color blocks, which include dark gray, gray, light gray,red, magenta, yellow, green, indigo, and blue. The light-dark patternbar 470C may include a pattern of progressively tighter (e.g., coarserto finer pattern tightness) light-dark patterns. The color bar 470B hasa known location relative to the solid color bar 470A. In exemplaryembodiments, the light-dark pattern bar 470C comprises a black-whitepattern bar with a progressively tighter checker pattern (e.g., a seriesof checker block pattern sections, wherein each subsequent checker blockpattern section in a particular direction along the light-dark patternbar 470C has a smaller checker pattern). For example, the light-darkpattern area (e.g., the light-dark pattern bar 470C) includes aplurality of segments (e.g., 470C-1, 470C-2, 470C-3, 470C-4, 470C-5,470C-N) arranged linearly from a first side of the light-dark patternarea to an opposite side of the light-dark pattern area, wherein each ofthe plurality of segments includes a geometrically similar repeatingpattern of light pixel areas and dark pixel areas, wherein a size ofeach light pixel area and each dark pixel area of each particularsegment progressively decreases from the first side of the light-darkpattern area to the opposite side of the light-dark pattern area. Thelight-dark pattern bar 470C has a known location relative to the solidcolor bar 470A. The control strips 470 may be used as known and staticinputs (e.g., predetermined and known size, position, color, colorarrangement, and known color patterns, the like) by the imaging processto locate and measure components of the image as well as for measuringthe integrity and quality of each image captured by the camera. In someembodiments, each side of the transparent vacuum insert assembly 401 isconfigured to receive and is associated with a unique control strip 470which can be identified and utilized throughout the imaging process.

Referring now to FIGS. 5A-5F, a further exemplary imaging station 500configured to simultaneously image two pages of a book 560 is depicted.Some embodiments of the imaging station 500 include one or more digitalcameras 510, a camera hood assembly 511, one or more light panels 520,one or more displays 530, one or more activation buttons 540 configuredto trigger the camera to capture an image of two pages of a book 560, anactivation pedal, a vacuum assembly 570, a transparent platen coverassembly 580, a book vacuum chamber assembly 590, one or morepositioning lasers configured to emit a laser pattern 591 (such as across-hair pattern; a grid pattern; a pattern comprised of points,curves, and/or lines; or the like) onto a document imaging area of thebook vacuum chamber assembly 590, or the like.

In some embodiments, the vacuum assembly 570 is configured to create alower differential pressure (e.g., a vacuum) between the transparentplaten cover assembly 580 and the book vacuum chamber assembly 590 tolift a flexible sheet 593, which supports a book 560, such that twopages of the book 560 are pulled flat against the bottom surface of atransparent portion 582 (e.g., a tempered glass portion) of thetransparent platen cover assembly 580. In some implementations, thevacuum assembly 570 may operate based upon the Venturi principle byflowing compressed air through a tee-apparatus, which includes anopening, a smaller opening, and side tee-branched tube; this effectcreates a lower pressure (i.e., vacuum effect) in the tube; the tubeconnects to the book vacuum chamber assembly 590 to create a vacuumeffect. When the vacuum assembly 570 is activated, the vacuum effectdraws the flexible sheet 593 (e.g., such as a gum rubber sheet, aneoprene sheet, a nylon sheet, a Kevlar sheet, a polyester sheet, or thelike) and the book 560 resting upon the flexible sheet 582 toward andagainst the bottom surface of the transparent portion 582 of thetransparent platen cover assembly 580.

In some embodiments, the transparent platen cover assembly 580 isattached (e.g., via hinges) to one or more portions of the imagingstation 500 along a pivot axis such that the transparent platen coverassembly 580 can swing to an open position or swing to a closed orsealed position with respect to the book vacuum chamber assembly 590.When the transparent platen cover assembly 580 is in a closed or sealedposition with respect to the book vacuum chamber assembly 590, surfacesalong a frame of the transparent platen cover assembly 580 may abut atop layer 592 of the book vacuum chamber assembly 590 to create an airseal (e.g., a semi-permeable or impermeable air seal) configured tomaintain a pressure differential, when the vacuum assembly 570 isactivated, between (a) a space between the transparent platen coverassembly 580 and the book vacuum chamber assembly 590, and (b) outsideof the space between the transparent platen cover assembly 580 and thebook vacuum chamber assembly 590. In some embodiments, the transparentplaten cover assembly 580 includes a transparent portion 582 (e.g.,glass, such as tempered glass) configured to allow high-quality imagesof one or more pages (e.g., one page or two pages simultaneously) of abook 560 to be imaged by a camera. In some embodiments, the transparentplaten cover assembly 580 includes one or more counter-weights 581configured to balance (or reduce the imbalance) the weight or torque ofthe transparent platen cover assembly 580 on each side of the pivot axisso as to reduce the amount of force required to move the transparentplaten cover assembly 580 between open and closed positions.

Referring now to FIG. 6A, an exploded view of some exemplary portions ofthe book vacuum chamber assembly 590 of the imaging station 500 of someembodiments is depicted. For example, the book vacuum chamber assembly590 may include a top layer 592, a flexible sheet 593 (as shown in FIGS.6B-C), one or more spacers 594, and a bottom layer 595. In a particularembodiment, the top layer 592, one or more spacers 594, and the bottomlayer 595 may be comprised of acrylic; however, in other exemplaryembodiments, the top layer 592, one or more spacers 594, and the bottomlayer 595 may be comprised of any suitable material or materials.Additional spacers 594 can be added or removed to accommodate differentthicknesses of books; for example, an exemplary implementation, whichincludes four spacers 594, may accommodate books over four inches thick;as such, the exemplary embodiment allows for easily imaging of even thefirst few pages when the left side of the book may be ⅛″ thick and theright side may be 4″ thick. In some embodiments, the bottom layer 595includes one or more apertures 596 configured to allow air to flowfreely through the one or more apertures 596 of the bottom layer 595such that an atmospheric pressure acts on the bottom side of theflexible sheet 593.

Referring now to FIGS. 6B-6C, exemplary cross-section views of theplaten cover assembly 580 and the book vacuum chamber 590 of someembodiments of the imaging station 500 are depicted.

FIG. 6B shows an exemplary cross-sectional view of a partially openedplaten cover assembly 580 and the book vacuum chamber 590 of the imagingstation 500 of some embodiments. As shown in FIG. 6B, the platen coverassembly 580 is in a partially open position with a gap between theplaten cover assembly 580 and the book vacuum chamber 590. In someembodiments, when the vacuum assembly 570 is deactivated the flexiblesheet 593 is stretched downward against the bottom layer 595 (as shown)due to the weight of the book 560; while in other embodiments, theflexible sheet may be configured to support sufficient tensile forcessuch that the flexible sheet 593 remains suspended above the bottomlayer 595 even when supporting the book 560. In exemplary embodiments,the vacuum assembly 570 is deactivated when the platen cover assembly580 is in an open position/raised position.

In some implementations, the flexible sheet 593 is porous, perforated,or includes apertures configured to let some air through the flexiblesheet 593; in implementations which the flexible sheet 593 is porous,perforated, or includes apertures, the flexible sheet 593 is configuredto allow some air to pass through the flexible sheet 593 but stillmaintain a pressure differential (between the sealed/closed book vacuumchamber assembly and outside the flexible sheet 593) sufficient to liftthe book 560 toward the bottom surface of the transparent portion 582 ofthe transparent platen cover assembly 580 when the vacuum assembly 570is activated. In other implementations, the flexible sheet 593 issubstantially air tight such that only a negligible amount of airpermeates the flexible sheet 593.

Referring now to FIG. 6C, an exemplary cross-sectional view of a closedand/or sealed platen cover assembly 580 and the book vacuum chamber 590of the imaging station 500 of some embodiments is shown. Once the platencover assembly 580 is in a closed and sealed position, a user orautomated process (e.g., upon detection of the platen cover assembly 580being in a closed position) can trigger the activation of the vacuumassembly 570 to vacuum air from a space between the platen coverassembly 580 and the book vacuum chamber 590. As the air is vacuumed outof the space, the flexible sheet 593 raises up and conforms to the shapeof the book 560 and the pages of the book 560 are flattened and pressedagainst the bottom of the transparent portion 582 of the platen coverassembly 580. As the book 560 is pressed against the bottom surface ofthe transparent portion 582 of the transparent platen cover assembly580, the pages of the book 560 are substantially flattened into a singlefocal plane along the surface of the transparent portion 582 of thetransparent platen cover assembly 580. Pressing the pages of the book560 into the single focal plane improves the quality of images to becaptured by the camera 510 and reduces sources of image distortion alongthe center spine of the book where the pages of an opened book meet.

Once the pages of the book 560 are imaged by the camera 510, a vacuumcycle progresses towards completion, and the vacuum assembly 570 isdeactivated causing the book 560 and the flexible sheet 593 to drop downand away from the transparent platen cover assembly 580 (to a state assimilarly shown in FIG. 6B). Once vacuum assembly 570 is deactivated,the transparent platen cover assembly 580 may be raised (e.g., by auser, an automated process, or a mechanized process). Once thetransparent platen cover assembly 580 is raised, the position laser mayemit a laser pattern 591 onto the book 560 and/or flexible sheet 593(e.g., as shown in FIG. 5C) for setting up another image capture of thebook 560, and a new imaging cycle/vacuum cycle can begin.

Exemplary embodiments of the imaging station 500 are configured toaccommodate and operate with books weighing as much as 20 pounds ormore. Additionally, exemplary imaging stations 500 is configured toperform an imaging cycle of positioning and imaging two pages of a book560 every 12 seconds or less (which results in at least 300 images perhour or at least 600 pages per hour).

Referring now to FIGS. 7A-7B, a diagram of an exemplary system 700 of anexemplary embodiment of the invention is depicted. For example, in someembodiments, the system 700 includes one or more imaging stations 701,701A, 701B, 701C (which, for example, may be implemented as any ofimaging stations 100, 200, 300, 500); a network 720; a metadata/qualitycontrol portal 730; an imaging processing device or module 740; or thelike, wherein the elements of the system are communicatively coupled viathe network 720. In some embodiments, the imaging station 701 includesone or more user input devices 712A, 712B; one or more sensor devices,such as cameras 711A, 711B; one or more output devices, such as displays713A, 713B; a vacuum switch 714; at least one computing device 710; orthe like, wherein one or more of the elements of the imaging system 701are communicatively coupled.

Referring now to FIGS. 8-9, exemplary methods 800, 900 for imageprocessing of some embodiments are depicted.

Referring now to FIG. 8, an exemplary method 800 for performing initialimage processing of some embodiments is depicted. It is contemplatedthat embodiments of the method 800 can be performed by one or more ofthe following: an apparatus, a system, an apparati of a system, asub-system of a system, an imaging station 701, one or more elements ofan imaging station 701, a computing device (such as computing device 710of the imaging station 701 or another computing device connected to thenetwork 720); at least one component, integrated circuit, controller,processor, or module of a computing device (such as computing device 710of the imaging station 701 or another computing device connected to thenetwork 720); software or firmware executed on a computing device (suchas computing device 710 of the imaging station 701 or another computingdevice connected to the network 720); other computing devices (such as acomputing device of a metadata/quality control portal 730, an imageprocessing portal 740, a mobile computing device, or the like); othercomputer components; or on other software, firmware, or middleware of asystem topology. The method 800 can include any or all of steps 841,842, 843, 844, 845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 855,856, 857, 858, 859, 860, 810, 820, 830, and/or 870, and it iscontemplated that the method 800 includes additional steps as disclosedthroughout, but not explicitly set forth in this paragraph. Further, itis fully contemplated that the steps of the method 800 can be performedconcurrently, sequentially, or in a non-sequential order. Likewise, itis fully contemplated that the method 800 can be performed prior to,concurrently, subsequent to, or in combination with the performance ofone or more steps of one or more other methods disclosed throughout.

Some embodiments of the method 800 include a step 841, wherein the step8841 comprises opening a raw image file. Additionally, some embodimentsof the method 800 include a step 842, wherein the step 842 comprisesreading dimensional data of the raw image file. Some embodiments of themethod 800 also include the step 843, which comprises determiningwhether a resolution and aspect ratio of the raw image file isacceptable (e.g., correct or accurate). If the resolution and the aspectratio of the image are not acceptable, then the image file is rejected(step 857). Additionally, some embodiments of the method 800 include thestep 844, which comprises applying a light filter (such as a customlight filter) upon determining that the resolution and the aspect ratioof the image are acceptable. Some embodiments of the method 800 includethe step 845, which comprises creating a copy of the raw image andplacing the image copy on top of the raw image. Some embodiments of themethod 800 further include the step 846, which comprises converting theimage copy to a bitonal format. Some embodiments of the method 800include the step 847, which comprises determining whether controltargets (e.g., control strips 470) are present in the image file and/orimage copy. If the control targets are determined to be absent, then theimage file is rejected (step 857). Some embodiments of the method 800additionally include the step 848, which comprises measuring controltarget attributes (e.g., location, orientation, sharpness, focus, or thelike). Some embodiments of the method 800 further include the step 849,which comprises determining whether the control target attributes arewithin predetermined allowable specifications. If the control targetsattributes are determined to be outside of the predetermined allowablespecifications, the method further includes a step 858 of determiningwhether the image file can be corrected. If it is determined that theimage file cannot be corrected, then the image file is rejected (step857); however, if the image file is determined to be correctable, thenthe method includes a step 859 of correcting one or more imageattributes to at least within the predetermined allowablespecifications. Some embodiments of the method 800 include the step 850,which comprises discarding the image copy (e.g., the bitonal formatimage copy) upon (a) a determination that the control target attributesare within the predetermined allowable specifications or upon (b)correcting one or more image attributes to at least within thepredetermined allowable specifications. Some embodiments of the method800 include the step 851, which comprises performing a static crop ofthe raw image based on the control target attributes (e.g., attributesof the control strips 470). Some embodiments of the method 800 alsoinclude the step 852, which comprises creating a copy of the croppedimage and placing the image copy on top of the cropped image. Someembodiments of the method 800 further include the step 853, whichcomprises measuring a background color of the image file and/or imagecopy, wherein the background color of the image file corresponds to thecolor of the “out of gamut” substrate 420. Some embodiments of themethod 800 include the step 854, which comprises detecting and selectingall contiguous background color pixels based upon the measuredbackground color of the image file and/or image copy. Some embodimentsof the method 800 further include the step 855, which comprisesdetermining whether similar background pixels are required to beselected, wherein similar background pixels are background pixels with asame pixel color or approximately the same pixel color as the selectedcontiguous background color pixels but may be non-contiguous to theselected contiguous background pixel colors. If a selection of similarbackground pixels is not required, then method 800 for the initialimaging process will continue on with the performance of step 860. Onthe other hand, in some embodiments, the method 800 includes the step856, which comprises selecting some or all of the similar backgroundpixels upon a determination that a selection of similar backgroundpixels is required. Some embodiments of the method 800 include the step860, which comprises creating a mask (sometimes referred to as an alphachannel; described in more detail with reference to FIGS. 15D-G) fromthe selected background pixels (e.g., from the selected contiguousbackground pixels or from the selected contiguous background pixels andthe similar background pixels). Some embodiments of the method 800include the step 810, which comprises creating a temporary qualitycontrol (QC) thumbnail file upon creation of the mask. The temporaryquality control thumbnail file may be used by a quality control process910 in method 900, as described below. Some embodiments of the method800 include the step 820, which comprises creating a temporary metadatathumbnail file (e.g., a temporary metadata portal thumbnail file) uponcreation of the mask. The temporary metadata thumbnail file may be usedby a metadata process 920 in the method 900, as described below. Someembodiments of the method 800 include the step 830, which comprisescreating a high resolution archive file upon creation of the mask. Thehigh resolution archive file may be used by an optical characterrecognition (OCR) optimization process 930 and an image optimizationprocess 940 in the method 900, as described below. Some embodiments ofthe method 800 include the step 870, which comprises completing theinitial image processing.

Referring now to FIG. 9, an exemplary method 900 for image processing ofsome embodiments is depicted. It is contemplated that embodiments of themethod 900 can be performed by one or more of the following: anapparatus, a system, an apparati of a system, a sub-system of a system,an imaging station 701, one or more elements of an imaging station 701,a computing device (such as computing device 710 of the imaging station701 or another computing device connected to the network 720); at leastone component, integrated circuit, controller, processor, or module of acomputing device (such as computing device 710 of the imaging station701 or another computing device connected to the network 720); softwareor firmware executed on a computing device (such as computing device 710of the imaging station 701 or another computing device connected to thenetwork 720); other computing devices (such as a computing device of ametadata/quality control portal 730, an image processing portal 740, amobile computing device, or the like); other computer components; or onother software, firmware, or middleware of a system topology. The method900 can include any or all of steps 910, 911, 912, 913, 914, 920, 921,922, 923, 924, 930, 931, 932, 933, 934, 940, 941, 942, 943, 944, 945,946, 950, 953, 954, 955, 958, 959, 961, and/or 962, and it iscontemplated that the method 900 includes additional steps as disclosedthroughout, but not explicitly set forth in this paragraph. Further, itis fully contemplated that the steps of the method 900 can be performedconcurrently, sequentially, or in a non-sequential order. Likewise, itis fully contemplated that the method 900 can be performed prior to,concurrently, subsequent to, or in combination with the performance ofone or more steps of one or more other methods disclosed throughout.

As shown in FIG. 9, some embodiments of the method 900 may includeperformance of a plurality of processes and/or sub-processes; forexample, the method 900 may include performance of a quality controlprocess 910, a metadata process 920, an OCR optimization process 930, animage optimization process 940, and a file reassembly process 950. Inexemplary embodiments, the method 900 may be performed as a plurality ofiterations on a plurality of document images, wherein each iteration ofthe method 900 processes one or more images and/or one or more filesassociated with a particular imaged document.

In some embodiments, the quality control process 910 includesperformance of a plurality of sub-processes. In exemplary embodiments,the quality control process 910 performs quality control operations onthe temporary quality control thumbnail file 810. The quality controlprocess 910 may include a step 911 of performing an automated qualitycontrol process on the temporary quality control thumbnail file 810. Thequality control process 910 may also include a step 912 of performing amanual quality control process; for example, the step 912 may includepresenting (e.g., displaying) an image or file (e.g., the temporaryquality control thumbnail file 810) to a user (e.g., a quality controluser) on a computing device (e.g. metadata/quality control portal 730)to inspect the temporary quality control thumbnail file 810 andreceiving one or more inputs (e.g., selections, approval, rejections,flagging an image, notations, or the like) from the user upon presentingthe image or file to the user. The quality control process 910 may alsoinclude a step 913 of performing a final manual vetting (e.g. visualexamination) of the temporary quality control thumbnail file 810 toapprove the image. The quality control process 910 may additionallyinclude a step 914 of writing the quality control results to a database.

In some embodiments, the metadata process 920 includes performance of aplurality of sub-processes. In exemplary embodiments, the metadataprocess 920 performs metadata processing operations on the temporarymetadata thumbnail file 820. The metadata process 920 may include a step921 of gathering data from the file name (e.g., the temporary metadatathumbnail file 820). The metadata process 920 may also include a step922 of gathering data from a file hierarchy. The metadata process 920may further include a step 923 of gathering data from a database. Themetadata process 920 may additionally include a step 924 of writingmetadata (e.g., gathered data) to the database.

In some embodiments, the optical character recognition (OCR)optimization process 930 includes performance of a plurality ofsub-processes. In exemplary embodiments, the optical characterrecognition (OCR) optimization process 930 performs OCR optimizationprocessing operations on the high resolution archive file 830 or a copyof the high resolution archive file 830. The optical characterrecognition (OCR) optimization process 930 may include a step 931 ofconverting the high resolution archive file 830 or a copy of the highresolution archive file 830 to a high contrast bitonal image file. Theoptical character recognition (OCR) optimization process 930 may alsoinclude a step 932 of converting the high contrast bitonal image file toa higher resolution. The optical character recognition (OCR)optimization process 930 may further include a step 933 of convertingthe higher resolution high contrast bitonal image file to a temporaryPDF (portable document format) file. The optical character recognition(OCR) optimization process 930 may additionally include a step 934 ofstoring the temporary PDF file.

In some embodiments, the image optimization process 940 includesperformance of a plurality of sub-processes. In exemplary embodiments,the image optimization process 940 performs image optimizationprocessing operations on the high resolution archive file 830 or a copyof the high resolution archive file 830. The image optimization process940 may include a step 941 of improving sharpness of text (e.g.,optimizing text for sharpness). The image optimization process 940 mayalso include a step 942 of improving image smoothness (e.g., optimizingpictures for smoothness). The image optimization process 940 may furtherinclude a step 943 of reducing a size of the high resolution archivefile 830 or a copy of the high resolution archive file 830 (e.g.optimizing an entire image for minimal size). The image optimizationprocess 940 may additionally include a step 944 of creating and/orstoring an optimized PDF image file upon performance of one or more(e.g., one, two, or three) of steps 941, 942, 943. The imageoptimization process 940 may further include a step 945 of creating andstoring a large thumbnail file upon performance of one or more (e.g.,one, two, or three) of steps 941, 942, 943. Additionally, the imageoptimization process 940 may include a step 946 of creating and storinga small thumbnail file upon performance of one or more (e.g., one, two,or three) of steps 941, 942, 943.

In some embodiments, the file reassembly process 950 includesperformance of a plurality of sub-processes. In exemplary embodiments,the file reassembly process 950 performs file reassembly operations onone or more files loaded, used, written, created, and/or stored duringperformance of either or both the method 800 or the method 900. The filereassembly process 950 may include a step 953 of combining the temporaryPDF file (e.g., which was created, stored, and/or modified in step 934)with the optimized PDF image file (e.g. which was created, stored,and/or modified in step 944) to create a new optimized PDF image file.The file reassembly process 950 may also include step 955 of gathering(e.g., reading) metadata and/or filename data from the database andcreating a new PDF file which contains the gathered data (e.g., thegathered metadata). The file reassembly process 950 may also includesteps 954 and 958 of creating a final optimized PDF file which includesmetadata based on the new optimized PDF image file and the gathered data(from step 955). In some embodiments, the steps 954, 955, 958 may becombined into a single step, which may, for example, comprise creating afinal optimized PDF file based at least upon (a) the new optimized PDFimage file and (b) metadata gathered from the database; similarly, forexample, the steps 954, 955, 958 may be combined into a single step ofcreating a final optimized PDF file by incorporating the gatheredmetadata into the new optimized PDF image file. The file reassemblyprocess 950 may additionally include a step 959 of verifying (e.g.,checking) whether a particular image associated with the final optimizedPDF file has been approved during the quality control process 910 byaccessing the quality control results for the particular image whichwere written to the database (in step 914). The file reassembly process950 may include a step 961: upon a verification that the particularimage associated with the final optimized PDF file has been approved,all associated files (e.g., the final optimized PDF file, anythumbnails, or the like) associated with the particular approved imageproceed to a final archive. The file reassembly process 950 may includea step 962: if the particular image associated with the final optimizedPDF file has not been approved, the raw image is corrected and allassociated files are reprocessed.

Referring now to FIG. 10, an exemplary method 1000 for performingquality control processes of some embodiments is depicted. It iscontemplated that embodiments of the method 1000 can be performed by oneor more of the following: an apparatus, a system, an apparati of asystem, a sub-system of a system, an imaging station 701, one or moreelements of an imaging station 701, a computing device (such ascomputing device 710 of the imaging station 701 or another computingdevice connected to the network 720); at least one component, integratedcircuit, controller, processor, or module of a computing device (such ascomputing device 710 of the imaging station 701 or another computingdevice connected to the network 720); software or firmware executed on acomputing device (such as computing device 710 of the imaging station701 or another computing device connected to the network 720); othercomputing devices (such as a computing device of a metadata/qualitycontrol portal 730, an image processing portal 740, a mobile computingdevice, or the like); other computer components; or on other software,firmware, or middleware of a system topology. The method 1000 caninclude any or all of steps 1010, 1011, 1012, 1013, 1014, 1015, 1016,1020, 1021, 1022, 1023, 1024, 1025, 1026, 1027, and/or 1030, and it iscontemplated that the method 1000 includes additional steps as disclosedthroughout, but not explicitly set forth in this paragraph. Further, itis fully contemplated that the steps of the method 1000 can be performedconcurrently, sequentially, or in a non-sequential order. Likewise, itis fully contemplated that the method 1000 can be performed prior to,concurrently, subsequent to, or in combination with the performance ofone or more steps of one or more other methods disclosed throughout.

As shown in FIG. 10, some embodiments of the method 1000 may includeperformance of a plurality of processes and/or sub-processes; forexample, the method 900 may include performance of an automated qualitycontrol process 1010 and a manual quality control process 1020. Inexemplary embodiments, the method 1000 may be performed as a pluralityof iterations on a plurality of document images, wherein each iterationof the method 1000 is associated with one or more images and/or one ormore files associated with a particular imaged document.

In some embodiments, the automated quality control process 1010 includesperformance of a plurality of sub-processes. In some exemplaryembodiments, the automated quality control process 1010 comprises thestep 911 of the method 900 of performing an automated quality controlprocess on the temporary quality control thumbnail file 810. Referringto FIG. 10, in exemplary embodiments, the automated quality controlprocess 1010 performs quality control operations on a quality controlimage. The automated quality control process 1010 may include a step1011 of receiving, opening, or loading a quality control image (e.g., atemporary quality control thumbnail file 810). The automated qualitycontrol process 1010 may also include a step 1012 of reading a mask areaand border pixels. The automated quality control process 1010 mayfurther include a step 1013 of determining whether a mask to backgroundratio is acceptable (e.g., a ratio of a number of mask pixels to anumber of background pixels is above or below a predetermined thresholdvalue). If the mask to background ratio is acceptable, the automatedquality control process 1010 includes a step 1015 of processing a nextimage or a step 1016 of terminating the automated quality controlprocess 1010 if there are no more image files to process. However, ifthe mask to background ratio is determined to be unacceptable, theautomated quality control process 1010 includes a step 1014 ofdetermining whether a border pixel ratio is acceptable (e.g., a ratio ofa number of background pixels to a number of non-background pixels isabove or below a predetermined threshold value). If the border pixelratio is acceptable, the automated quality control process 1010 includesa step 1015 of processing a next image or a step 1016 of terminating theautomated quality control process 1010 if there are no more image filesto process. However, if the background pixel ratio is determined to beunacceptable, the automated quality control process 1010 includes a step1030 of quarantining the quality control image.

In some embodiments, the manual quality control process 1020 includesperformance of a plurality of sub-processes. In some exemplaryembodiments, the manual quality control process 1020 comprises the step912 of the method 900 of performing a manual quality control process onthe temporary quality control thumbnail file 810. Referring to FIG. 10,in exemplary embodiments, the manual quality control process 1020performs quality control operations on a quality control image. Themanual quality control process 1020 may include a step 1021 ofreceiving, opening, or loading a particular quality control image (e.g.,a temporary quality control thumbnail file 810). The manual qualitycontrol process 1020 may include a step 1022 of determining whether thedocument orientation is correct (or substantially correct such that anautomated process can rotate the image by less than 45 degrees tocorrect the document orientation); the step 1022 may include receiving auser input from a user with a selection which indicates whether theorientation is correct. The manual quality control process 1020 mayinclude a step 1023 of determining whether all of the document elementsare present; the step 1023 may include receiving a user input from auser with a selection which indicates whether all of the documentelements are present. The manual quality control process 1020 mayinclude a step 1024 of determining whether the non-document elementshave been removed; the step 1024 may include receiving a user input froma user with a selection which indicates whether the non-documentelements have been removed. The manual quality control process 1020 mayalso include a step 1025 of determining whether document legibility isacceptable; the step 1025 may include receiving a user input from a userwith a selection which indicates whether document legibility isacceptable. Additionally, the manual quality control process 1020 mayinclude a step 1030 of quarantining a particular image if any of thesteps 1022, 1023, 1024, or 1025 result in negative or unacceptableresponses. However, if all of the steps 1022, 1023, 1024, or 1025 resultin positive or acceptable responses, the manual quality control process1020 includes a step 1027 of processing a next image or a step 1026 ofterminating the manual quality control process 1020 if there are no moreimage files to process.

Referring now to FIGS. 11A-D, exemplary depictions associated withperforming coarse image measurement processes of some embodiments areshown. Performance of coarse image measurement processes prepares theimage for subsequent software measurement processes.

Referring now to FIGS. 11A and 11B, an exemplary extended documentimaging area 1100A is shown. The extended document imaging area 1100A(e.g., an area corresponding to the top side of the transparent vacuuminsert assembly 401 with a document 480 placed on top of a portion ofthe transparent vacuum insert assembly 401) may include areas outside ofwhere a camera (e.g., 110, 310A, 310B, 510) will capture an image. Inexemplary embodiments, in order to effectively “isolate” a document, itcan be determined approximately where the document is within the imagearea 1100B (see FIG. 11B). The image area 1100B refers to the fulldigital image as captured and delivered from the camera; this fulldigital image is also referred to as the raw image. FIG. 11A depicts theimage area 1100B as it relates to the surrounding extended documentimaging area 1100A. The shaded portion of the extended document imagingarea 1100A around the image area 1100B represents the physical areawhich is outside a view of the camera. In some embodiments, the cameramay be calibrated or configured to have an image area 1100B whichcaptures only a portion (e.g., approximately half) of each solid colorbar 470A of the control strips 470.

In order to establish a coarse location of the document 480, someembodiments include creating an image copy which can be manipulated formeasurement purposes; for example, this determination may includeperformance of step 842 of reading dimensional data and performance ofstep 843 of determining whether the resolution and aspect ratio iscorrect of the method 800. The image copy may be placed directly on topof the raw image and is typically discarded later. The image copy maythen be converted to a grayscale image. The grayscale image may containpixels that vary from light to dark but do not contain a color value. Inexemplary grayscale images, any given pixel can have an integral valueof between 0 and 255 (i.e., 8-bit); however, embodiments of theinvention are not limited to 8-bit grayscale pixel values and may beimplemented with any other suitable number of grayscale pixel values.Additionally, grayscale image can be further manipulated to result in ablack and white image 1100D, where all of the pixel values are either 0(e.g., black) and 1 (e.g., white).

Referring now to FIG. 11C, in some embodiments which are implementedwith a document imaging area that includes vacuum holes, the grayscaleimage may be manipulated to remove the vacuum holes 451 from thegrayscale image to result in a de-holed grayscale image 1100C. While thevacuum holes 451 force the document 480 flat and secure the document 480during image capture, the vacuum holes 451 can be detrimental to imageprocessing. A series of image contractions and expansions may beperformed to remove the vacuum holes 451 from the grayscale image.

Referring now to FIG. 11D, an illustration of a bitonal image 1100D(e.g., a black and white image, as described above with reference toFIGS. 11A-B), which is converted from the grayscale image, is depicted(see also step 846 of converting the image copy to bitonal format of themethod 800). The bitonal image has pixel values which are one of onlytwo values (e.g., 0 or 1).

Referring now to FIGS. 12A-J, exemplary depictions associated withperforming fine image measurement processes of some embodiments areshown.

Referring now to FIG. 12A, illustration 1200A shows the key areas withinthe image which are to be measured for determining positioning values orcoordinates. The circled areas of illustration 1200A denote edgesbetween black portions of each control strip 470 and lighter (e.g.,white) portions of the image which may be used to determine documentlocation. The precise location of a document 480 within the image may bedetermined by calculating a value of black pixels versus white pixels ina given area. These calculated values may also be used to measure imageintegrity and quality.

Referring now to FIG. 12B, illustration 1200B demonstrates how documentsize is determined. The value to be measured is the relative size of theimage. All of the raw images of a particular camera have a fixed amountof pixels in height and width. While the number of pixels in width andheight is known, the actual size of the document or object which iscaptured by the camera may be unknown. Because an image is projectedonto a camera's sensor by a lens, it may be unknown to what extent thelens was zoomed-in when it captured the raw image. While the actual sizeof the document or object may be unknown, the actual size of thedocument or object may be determined based on known information withrespect to the control strips 470. For example, the actual width of thesolid color bars of each control strip 470 is predetermined and knownbased on a known location of each color strip 470 and known location ofthe components (e.g., 470A, 470B, 470C) on each control strip 470.Therefore, the size of the imaged area and the document image may bedetermined by measuring the solid color bar 470A of a particular controlstrip 470. Likewise, a dots per inch (dpi) value or pixels per inch(ppi) value may be computed in a similar manner, and these values may beused to determine a scale of the image for measuring distances,subsequent image correction processes, and final document preparationprocesses.

In some embodiments, the scale of the image may be used for mapping theentire image. This “map” allows processes to perform quality correctionsand may provide the location of key static targets (e.g., control strips470, and components 470A, 470B, 470C of the control strips).

Referring now to FIG. 12C, an illustration 1200C demonstrates that theorigin (i.e., a point of intersection of the coordinate axes) of thex-axis is determined by measuring from a corner pixel of the image to ahorizontal (x-axis) centerline, which is calculated by dividing pixeldimensions by two.

Referring now to FIG. 12D, an illustration 1200D demonstrates that theorigin of the y-axis is determined by measuring from the corner pixel ofthe image to a vertical (y-axis) centerline, which is calculated bydividing pixel dimensions by two.

Referring now to FIGS. 12E-F, illustrations 1200E, 1200F demonstratethat skew factors may be determined by measuring and comparing thedistance from the ends of the control strips 470 to corner points of theimage. For example, this may include measuring a strip of pixels acrossthe top of an image where the control target is known to be located; thenumber of black pixels and the number of white pixels can be recorded,and the black pixels will be the solid color bar of the control strips470, and the white pixels may be the background.

Referring now to FIG. 12G, FIG. 12H, and FIG. 12I, illustrations 1200G,1200H, 1200I demonstrate the measuring of additional points within theimage.

Referring now to FIG. 12J, illustration 1200J demonstrates a grid thatmay be created from the various measured points (as shown in FIGS.12A-I).

In some embodiments, performing coarse image measurement processes(e.g., as described with respect to FIGS. 11A-D) and/or performing fineimage measurement processes (e.g., as described with respect to FIGS.12A-J) may include measuring image attributes by performing statisticalanalysis operations on the image. For example, in some embodiments,performing statistical analysis operations on the image may be performedbased on static control targets (e.g., an image of the control strips470) within the image and known image file attributes (e.g., the pixeldimensions of the image, the pixel value of each pixel, the pixelcoordinates of particular pixels, or the like). For example, performinga statistical analysis operation may include creating and evaluating ahistogram of the pixel values of an image to determine the distributionof pixel values throughout the image. Each pixel has a particular pixelvalue (e.g., an integral value, such as between 0 and 255 for an 8-bitgrayscale image). The created histogram contains a distribution ofpixels by pixel darkness (corresponding to the pixel value). Performinga statistical analysis operation may include evaluating the distributionof pixels by pixel value to determine statistical metrics (e.g., mean,median, mode, standard deviation, variance, or the like) of thedistribution; for example, the average and median color of the image canbe determined from the statistical metrics. In some embodiments,performing a statistical analysis operation further includes applying athreshold filter to the histogram for generation of a bitonal image. Insome embodiments, the threshold filter may be automatically or manuallyset to a particular value (e.g., predetermined, default, or variable)based on one or statistical metrics obtained from the distribution ofpixels; for example, threshold filter may be automatically or manuallyset to a particular value based on the median and mean pixel values. Forexample, an exemplary threshold filter (e.g., a threshold light filter)may be configured to set every pixel with a value of 127 or below tozero, and every pixel with a value of 128 and above to 255 to create abitonal image with pixel values of 0 or 255.

Additionally, performing statistical analysis operations providesinformation to processes for making adjustments to correct for variables(e.g., lighting, camera position, color casts, lens aberrations, or thelike).

Referring now to FIGS. 13A-F, exemplary depictions associated withperforming automated image integrity corrections of some embodiments areshown. In some embodiments, performing automated image corrections mayinclude comparing measurements (e.g. fine image measurements) againstthe predetermined standards based on data from the fine imagemeasurements. If an image is within control limits, the image advances;however, if the image is outside of the control limits but withinsoftware correction limits, it will be corrected (see also, step 849 ofmethod 800, determining whether the target attributes are within thespecifications). For example, embodiments of the invention areconfigured to perform x,y origin corrections as shown in illustration1300A of FIG. 13A. Embodiments of the invention may be configured toperform skew corrections as shown in illustration 1300B of FIG. 13B.Embodiments of the invention may be configured to perform horizontalkeystone corrections as shown in illustration 1300C of FIG. 13C.Embodiments of the invention may be configured to perform verticalkeystone corrections as shown in illustration 1300D of FIG. 13D.Embodiments of the invention may be configured to perform pincushiondistortion corrections as shown in illustration 1300E of FIG. 13E.Additionally, embodiments of the invention may be configured to performbarrel distortion corrections as shown in illustration 1300F of FIG.13F.

Referring now to FIGS. 14A-E, exemplary depictions associated withperforming coarse document isolation operations of some embodiments areshown. In some embodiments, coarse document isolation operations may beperformed upon performance of coarse image measurement operations, fineimage measurement operations, and automated image corrections. FIGS.14A-E depict the isolation (e.g. coarse isolation) of the live document480 from the overall image.

Referring now to FIG. 14A, a cropped image 1400A is depicted whichincludes a live document 480 portion and a background portion 1410(e.g., which may be associated with the out of gamut substrate 420). Thecropped image 1400A represents the state of the processed imagefollowing a static crop (see also step 851 of the method 800) of theimage to remove the control targets (e.g., control strips 470) from theimage.

Referring now to FIG. 14B, a bordered image 1400B is depicted. Thebordered image may comprise a live document 480 portion, a backgroundportion 1410, and a border portion 1420 (created from a portion of thebackground portion 1410 around the edge of the cropped image 1400A). Inorder to precisely separate the out of gamut substrate color of thebackground portion 1410 from the live document 480 portion, thebackground color may be sampled (e.g., measured; see also step 853 ofthe method 800). Inadvertently measuring any part of the live document480 portion would contaminate the sampling of the background color. Toprevent contamination during the sampling of the background color, theprocess includes sampling the color of a border portion 1420 of thebackground portion 1410.

Referring now to FIGS. 14C, 14D, and 14E, a border filled image 1400C,1400D are depicted. The coarse document isolation process may include anoperation to remove the holes from the border portion 1420, 1430 and anoperation to fill the border portion 1420 with an average backgroundcolor measured within the border portion 1420. The filled border portionof the border filled image 1400C, 1400D may be grown to select similarlycolored portions of the background portion 1410 which results in theselected background portion 1440, as shown in the image 1400E of FIG.14E; that is, the average fill of the border portion 1420 is expanded toother similarly colored areas of the background portion 1410 (e.g.,selecting contiguous pixels or similar pixels; see also steps 854, 855,856 of the method 800). In some embodiments, the expansion of theaverage fill into the background portion 1440 may cut into the edges ofthe live document portion 480, so a contraction (i.e., by a size lessthan the size of the expansion) may be performed to reverse the effects(e.g., edge loss of the live document portion 480) of the expansion. Asshown in FIG. 14E, an isolated (e.g., coarsely isolated) document 480 ofthe image 1400E is depicted.

Referring now to FIGS. 15A-G, exemplary depictions associated withperforming fine document isolation of some embodiments are shown. Insome embodiments, fine document isolation operations may be performedupon performance of coarse document isolation operations. FIGS. 15A-Gdepict the isolation of a document 1501 of an overall image 1500A.

Referring now to FIG. 15A, a coarsely isolated image 1500A of a document1501 and background 1510 is shown. In some embodiments, the document maybe an old archive document, which includes damage to the edges,hole-punches, or other conditions. As shown in FIG. 15A, the coarselyisolated image 1500A is represented as being at a state following theperformance of coarse document isolation operations, as described withreference to FIGS. 14A-E.

Referring now to FIG. 15B, a trimmed image 1500B is shown. The processincludes trimming the coarsely isolated image 1500A on all sides (e.g.,all four sides) until a first pixel is reached on each side of thedocument 1501 without eliminating any pixels of the document 1501. Itshould be noted that there still may be some background color in theportion 1511 and in the hole-punches of the document 1501.

Referring now to FIG. 15C, a rotated image 1500C is shown. The processincludes rotating the trimmed image 1500B to a correct orientation(e.g., as shown in FIG. 15C). In some embodiments, the correctorientation may be determined by detecting the activation of 2 LEDs(light emitting diodes) which may be located adjacent to the controlstrips 470. The LEDs may be toggled on and off and may be detected atthe same time the document coordinates are measured.

Referring now to FIG. 15D, a masked image 1500D is shown. The maskedimage may include a mask 1502. The imaging process may include creatingthe mask 1502 based upon the selections of the coarse document isolationoperations and other fine document isolation operations. (The mask 1502is sometimes referred to as an alpha channel; for example, at this pointthe image file may contain four channels: a red channel, a blue channel,a green channel, and the alpha channel (mask)). The isolation process isconfigured to mask out the background areas of the rectangular image andanalyze only pixels that are part of the document; this enables theprocess to map or fit the color space of each and every document to thefull color space of its intended rendering device (e.g., a computerdisplay, a printer, or the like). In some embodiments the process canperform accurate tone and color cast corrections based on this maskeddata, which results in full contrast and color balanced documents.

Referring now to FIG. 15E, an image 1500E is shown. The image processingmay include removing the background fill color of the background portion1512 of the image 1500E by utilizing the mask 1502. For example, theimage processing may include loading the alpha channel (mask 1502) fromthe image and creating a polygon selection. The complexity of thepolygon may be related to the edges of the document 1501. The polygonselection may either target the document 1501 or non-document area 1512(which is the remaining background of the image 1500E). For example,removing the background fill color of the background portion 1512 mayinclude replacing the background fill color (e.g., green) with adifferent color (e.g., light gray). In some embodiments, additionalvisual effects (e.g., a drop shadow, or the like) may be added to theimage for various visual impacts.

Referring now to FIGS. 15F-G, exemplary images 1500F, 1500G of someembodiments, which are configured to perform edge healing, are depicted.Some embodiments include analyzing background color of the document 1501along edges 1501A of the mask 1502. In some embodiments, the edges maybe a single pixel thick; however, in other embodiments the edges may bemultiple pixels thick. In some embodiments, performing the edge healingincludes determining an average document background color (e.g.,corresponding to non-ink portions of the document 1501) by analyzing adistribution of pixel values. In some embodiments, performing the edgehealing then includes filling the non-document area 1510 with theaverage document background color to create an edge healed document 1503as shown in the edge healed image 1500G shown in FIG. 15G.

Additionally, some embodiments include removing document staining (e.g.,yellow stains on aged documents). For example, an average documentbackground color (e.g., corresponding to non-ink portions of thedocument 1501) may be determined by analyzing a distribution of pixelvalues. The average document background color (e.g., corresponding tonon-ink portions of the document 1501) may be lightened and then filledinto the non-ink portions of the document 1501.

Additionally, some embodiments of the invention include a method fornumerically measuring a degree of focus and/or sharpness of an imagebased on at least one static control target. Because image sharpness andfocus are typically subject to a person's subjective perception, it canbe difficult for a computer process to measure a degree focus and/orsharpness of an image; however, some embodiments of the inventioninclude a method for numerically measuring a degree of focus and/orsharpness of an image based on static control targets. The method mayinclude determining a location of a light-dark pattern area (e.g.,light-dark pattern bar 470C) of at least one control target (e.g., atleast one control strip 470) within an image captured by an imagingstation, wherein the light-dark pattern area includes a plurality ofsegments arranged linearly from a first side of the light-dark patternarea to an opposite side of the light-dark pattern area, each of theplurality of segments including a geometrically similar repeatingpattern of light pixel areas and dark pixel areas, wherein a size ofeach light pixel area and each dark pixel area of each particularsegment progressively decreases from the first side of the light-darkpattern area to the opposite side of the light-dark pattern area. Themethod may also include determining pixel values of the pixels of thelight-dark pattern area. The method may also include, upon determiningthe pixel values of the pixels of the light-dark pattern area,determining a ratio of dark to light pixels for each of the plurality ofsegments of the light-dark pattern area. The method may also includedetermining which of the plurality of segments of the light-dark patternarea exceed a predetermined dark-to-light pixel ratio threshold. Themethod may also include determining a degree of focus based on thesegments having ratios of dark to light pixels which exceed apredetermined dark-to-light pixel ratio threshold (e.g., determining adegree of focus based on a number of the segments having ratios of darkto light pixels which exceed a predetermined dark-to-light pixel ratiothreshold). The method may further include determining whether themeasured focus is acceptable based upon a predetermined focus thresholdvalue. The method may also include, upon a determination that themeasured focus is unacceptable, adjusting a lens focus of a camera ofthe imaging station and recapturing the image of the document.

It is believed that embodiments of the present invention and many of itsattendant advantages will be understood by the foregoing description,and it will be apparent that various changes can be made in the form,construction, and arrangement of the components thereof withoutdeparting from the scope and spirit of the invention or withoutsacrificing all of its material advantages. The form herein beforedescribed being merely an explanatory embodiment thereof, it is theintention of the following claims to encompass and include such changes.

What is claimed is:
 1. A method for performing image processing of aphysical document image, comprising: receiving an image file comprisingan image of a physical document and at least one control target;measuring control target attributes of the at least one control target,wherein measuring the control target attributes of the at least onecontrol target includes: numerically measuring a degree of focus of theimage file based on the at least one control target, wherein numericallymeasuring the degree of focus of the image file based on the at leastone control target comprises: determining a location of a light-darkpattern area of the at least one control target; determining pixelvalues of the light-dark pattern area; determining a ratio of dark tolight pixels for each of a plurality of segments of the light-darkpattern area; determining which of the plurality of segments of thelight-dark pattern area exceed a predetermined dark-to-light pixel ratiothreshold; and determining the degree of focus based on a number ofsegments that exceed the predetermined dark-to-light pixel ratiothreshold; and performing a static crop of the image file based on thecontrol target attributes by removing the at least one control targetfrom the image file to create a cropped image.
 2. The method of claim 1,further comprising: receiving the image file from an imaging station. 3.The method of claim 2, wherein the imaging station comprises a bookimaging station.
 4. The method of claim 2, wherein the imaging stationcomprises a microfiche card imaging station.
 5. The method of claim 2,wherein the imaging station comprises an imaging station which includesa vacuum assembly.
 6. The method of claim 1, wherein the at least onecontrol target comprises at least one control strip.
 7. The method ofclaim 6, wherein the light-dark pattern area comprises a light-darkpattern bar.
 8. The method of claim 1, wherein the light-dark patternarea includes light pixel areas and dark pixel areas, wherein sizes ofthe light pixel areas and the dark pixel areas progressively decreasefrom one side of the light-dark pattern area to an opposite side of thelight-dark pattern area.
 9. The method of claim 8, wherein each of theplurality of segments of the light-dark pattern area include ageometrically similar repeating pattern of the light pixel areas and thedark pixel areas.
 10. The method of claim 1, wherein the image comprisesa bitonal image.
 11. The method of claim 1, further comprising:determining whether the numerically measured degree of focus isacceptable based upon a predetermined focus threshold value.
 12. Themethod of claim 11, further comprising: upon a determination that thenumerically measured degree of focus is unacceptable, adjusting a lensfocus of a camera of an imaging station and recapturing the image of thephysical document and the at least one control target.
 13. The method ofclaim 1, wherein the image file is a raw image file.
 14. The method ofclaim 13, further comprising: creating an image copy of the raw imagefile; and converting the image copy to a bitonal format.
 15. The methodof claim 14, further comprising: verifying that the at least one controltarget is present in the image copy.
 16. The method of claim 1, furthercomprising: creating a cropped image copy of the cropped image;measuring a background color of a background of the cropped image copyupon creating the copy of the cropped image, wherein the background isseparate from a live document portion of the cropped image copy;selecting contiguous background color pixels of the background basedupon the measured background color; and creating a mask based at leaston the selected contiguous background color pixels.
 17. The method ofclaim 16, further comprising: creating a temporary quality controlthumbnail file, a temporary metadata thumbnail file, and a highresolution archive file based at least on the created mask.
 18. Themethod of claim 17, further comprising: performing a quality controlprocess on the temporary quality control thumbnail file.
 19. The methodof claim 17, further comprising: performing a metadata process on thetemporary metadata thumbnail file.
 20. The method of claim 17, furthercomprising: performing an optical character recognition optimizationprocess on the high resolution archive file.
 21. The method of claim 17,further comprising: performing an image optimization process on the highresolution archive file.
 22. The method of claim 17, further comprising:performing a quality control process on the temporary quality controlthumbnail file; performing a metadata process on the temporary metadatathumbnail file; performing an optical character recognition optimizationprocess on the high resolution archive file; and performing an imageoptimization process on the high resolution archive file.
 23. The methodof claim 22, further comprising: performing a file reassembly process.24. The method of claim 23, wherein performance of the file reassemblyprocess outputs a final optimized portable document format (PDF) filewith metadata.